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Abstract. We systematically study the boundaries of one-dimensional, 2- 
color cellular automata depending on 4 cells, begun from simple initial con- 
ditions. We determine the exact growth rates of the boundaries that appear 
to be reducible. Morphic words characterize the reducible boundaries. For 
boundaries that appear to be irreducible, we apply curve-fitting techniques to 
compute an empirical growth exponent and (in the case of linear growth) a 
growth rate. We find that the random walk statistics of irreducible bound- 
aries exhibit surprising regularities and suggest that a threshold separates two 
classes. Finally, we construct a cellular automaton whose growth exponent 
does not exist, showing that a strict classification by exponent is not possible. 



1. Introduction 

Cellular automata are simple machines consisting of cells that update in parallel 
at discrete time steps. In general, the state of a cell depends on the state of its 
local neighborhood at the previous time step. The earliest known examples were 
engineered for specific purposes, such as the two-dimensional cellular automaton 
constructed by von Neumann in 1951 to model biological self-replication [1]. Three 
decades later, researchers began to study entire classes of automata, such as the 
256 one-dimensional cellular automata that use k = 2 colors and that depend 
on d = 3 cells [H [S]- The behavior of these rules subsequently garnered much 
attention. Most studies have focused on the interiors of patterns generated by 
cellular automata, likely because the boundaries are well known and simple for 
the fc = 2, cZ = 3 rules, such as the three linear boundaries shown in Figure [T] 
However, boundaries of automata are diverse, often more predictable than interiors 
(and hence more amenable to mathematical study), and even useful for detecting 
interesting behavior. 
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Figure 1 . Three 2-color cellular automaton rules depending on 3 
cells, begun from a single black cell. Despite very different interior 
behavior, the boundaries all exhibit simple linear growth. 
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Our main purpose in this paper is to inventory the boundary growth of the 
2^^ ~ 65536 one-dimensional rules that use k = 2 colors and that depend on 
d = 4 cells. Several rules in this space have boundaries not found among rules 
with shorter range d. For example, some nested automata have piecewise linear 
boundaries characterized by morphic words, while more chaotic automata have 
boundaries that behave like random walks. 

Boundaries of cellular automata have been studied before. Phillips [1] studied 
the k ^ 2, d = 3 automata with periodic-background initial conditions, which is 
more general than the constant-background initial conditions that we consider here, 
and he found boundary growth rates that depend on the initial condition. In a live 
experiment in 2005, Wolfram |5] investigated the boundaries of A; = 2, d = 4 rules 
begun from simple initial conditions. Our paper can be seen as the completion of 
this experiment. 

Because of the large size of the rule space, we are particularly interested in mak- 
ing our inventory programmatically accessible so that it can be searched and com- 
puted with. The Mathematica package CellularAutomatonData [6^ provides 
an interface to all the data we accumulated both programmatically and by hand. 
The primary function in this package uses the same syntax as the data functions 
built into Mathematica, and a cellular automaton is denoted {{n, k, (d— 1)/2}, init} 
to parallel CellularAutomaton. For example. 



CellularAutomatonData [{{1273, 2, 3/2} , {{1}, 0», "GrowthRate"] 



retrieves the limiting growth rate of the k — 2, d — A rule number 1273 begun 
from the initial condition • • • I I I M I n • • • , which is 6/5. The package Cellula- 
rAutomatonBoundaries [7] contains code used to generate the data in Cellu- 
larAutomatonData [B]. These packages are available from the websites of the 
authors [6l[7]. 

Section [2] of the paper establishes our notation and reviews the boundary growth 
rates for 2-color cellular automata depending on at most 3 cells. In Section [3] we 
describe a search for 2-color cellular automata depending on 4 cells that exhibit 
reducible boundary growth, and we discuss boundaries found by this search. In 
Section |4] we address the automata that were not found to have reducible growth; 
we study their growth rates statistically using tools typically applied to random 
walks. We also attempt to assign a growth function t^ to each automaton for some 
< 6 < 1. However, in Section [5] we show that in general this is impossible, by 
constructing an automaton for which no such b exists. In Section [6] we discuss 
possible extensions and open questions. 

Classifying automata by their boundaries identifies many automata with inter- 
esting behavior. Many boundaries closely reflect the behavior of the interior. For 
example, nested boundaries arise from nested automata, while chaotic boundaries 
arise from complex automata. Some automata with complicated interiors (such 
as rules 30 and 110 in Figure [I]) nevertheless have simple boundaries. Thus the 
complexity of an automaton's boundary provides a lower bound on the complexity 
of its interior. Throughout the paper we describe many interesting automata found 
in this way by using the boundary as a filter. 
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2. Background 

2.1. One-dimensional cellular automata. The cellular automata that we study 
m this paper are one-dimensional. A one- dimensional cellular automaton consists 
of 

• an alphabet S of size k, 

• a positive integer d, 

• a function i from the set of integers to E, and 

• a function / from S'' (d-tuples of elements in S) to E. 

The function i is called the initial condition, and the function / is called the rule. 
We think of the initial condition as an infinite row of discrete cells, each assigned 
one of k colors. To evolve the cellular automaton, we update all cells in parallel, 
where each cell updates according to /, a function of d cells in its vicinity on the 
previous step. 

There are k'^ rules on k colors depending on d cells. We adopt the usual con- 
vention of naming a cellular automaton's rule by the number whose base-fc digits 
consist of the outputs of the rule under the fc"* possible inputs of d cells, ordered 
reverse-lexicographically. For example, the 2-color rule depending on 3 cells that 
maps the 8 possible inputs according to the table 
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is rule OOOIIIIO2 = 30 in this numbering. Here we have identified = and 1 = ■. 

The evolution of a one-dimensional cellular automaton can be visualized in two 
dimensions by displaying each row below its predecessor. For example, Figure [l] 
shows 2^ steps of three rules evaluated from the initial condition • • • I I I M I "M • • • . 
To create such pictures we must choose a horizontal offset. For instance, the offset 
of rule 30 in the table above is center-aligned: every cell depends on the cells in 
the same position, 1=1 position to the left, and r = 1 position to the right. For a 
different offset, the rows in the automaton will be the same; each row simply shifts 
with respect to the row preceding it. In other words, shifting I and r to I — A 
and r + A, respectively, only shears the two-dimensional picture. Therefore, for 
convenience we generally choose a horizontal offset that minimizes the total width 
of the region of interest. 

2.2. Row lengths. We require that all but finitely many cells in the initial con- 
dition have the same color. Then each row has finite length, which we define as 
follows. If all cells in a row are the same color, the length of that row is 0. Oth- 
erwise, the length of a row is the number of cells in the region bordered by, and 
including, the first and last cells that differ from the constant background. For a 
given cellular automaton, let £{t) be the length of the row on step t for each t >0. 

For example, the length £{t) for rule 90 begun from • • • I I I M I "M • • • as in Fig- 
ure[l]is e{t) = 2t+l for all t > 0. For rule 30 the length is also i{t) = 2t + l, whereas 
for rule 110 it is £{t) ~ t + 1. Note that £{t) does not depend on the horizontal 
offset chosen to display the automaton. 

At each step in a cellular automaton, information can propagate at most / steps 
from the right boundary and at most r steps from the left boundary, where / and 
r depend on the offset chosen but are subject to I + 1 + r = d. In other words, the 
maximum growth rate possible (called the "speed of light") is d — 1 cells per step, 
and if the maximum growth rate persists over time, then £{t) = (rf— l)t + c for some 
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Figure 2. Rules 45 and 107 have row lengths that can be ex- 
pressed by Equation [T] Rule 106 exhibits square- root growth when 
begun from two adjacent black cells. 



c. If the maximum growth is achieved at every step, then £{t) = (d — l)t + £{0) for 
all t > 0. 

Because each row in a cellular automaton depends only on the previous row, the 
difference sequence i{t -|- 1) — £{t) is particularly relevant, since it gives the number 
of cells by which the automaton grows or shrinks at each step. It will be useful to 
think of the difference sequence as an infinite word on the set of integers. 

Definition 1. The boundary word of a cellular automaton is the sequence {£{t + 
l)-£it)}t>o- 

We will see that the boundary word frequently reflects properties of an automa- 
ton. 

If the boundary word is eventually periodic, then £{t) can be written as a piece- 
wise expression in linear functions. Namely, there exist integers m, tnun and rational 
numbers a and Cq, Ci, . . . , Cm-i such that for all t > t^^i^ we have 



(1) £it) 



at + Co if t = mod m 
at + Ci a t = 1 mod m 

at + Cm-i a t = m — 1 mod m. 



For example, the sequence £{t) for rule 45 begun from • • • I I I M I Tl • • • , shown in 

Figure [2] is 1, 3, 4, 6, 7, The boundary word 212121 • • • is periodic with period 

length 2, and the length of the row at step t is 



£{t) 



3t/2 + l ift = mod 2 
3{t+l)/2 if<=l mod 2 

for t > 0. Rule 107 begun from a single black cell is also shown in Figure [2] its 
boundary word 121202020242024 • • • , where 4 = -4, is eventually periodic with 
period length 4, and for t > 7 



m 



All 2-color cellular automata depending on d = 2 cells have eventually periodic 
boundary words, either with growth rate a = or a = 1. 

The boundaries of 2-color cellular automata depending on c? = 3 cells are largely 
similar. These automata generate a variety of internal structures: rule 90, for 
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example, produces nested structure, while rules 30 and 110 yield complex behavior. 
One new feature seen for d = 3 is square-root growth, exhibited for example by 
rule 106 begun from the initial condition • • • IZlZlZ^BZlZn • • • , as shown in Figure [2] 



We discuss square-root growth further in Section 3.3 However, with the exceptions 
of rules 106, 120, 169, and 225, each 2-color cellular automaton depending on d = 3 
cells has an eventually periodic boundary word. Moreover, for every automaton in 
this space (with a constant-background initial condition), the limiting growth rate 

(2, f 

t-+oo t 

exists and is an element of {0,1,3/2,2}. In particular, for rule 106 this limit is 
0. (Note that if we allow a general periodic background for the initial condition, 
then the boundary word is not necessarily eventually periodic; for example, the left 
boundary of rule 30 begun from the initial condition • • • I M M M M I • • • = 

• • • 0101011101010 • • • appears to be chaotic.) 

In general, the limiting growth rate limf^oo^(0/^ of ^ cellular automaton may 
not exist, as we see in Sectionjs] Moreover, the limiting growth exponent limt_>.oo logj i{t) 
may not exist, as we show in Section [5] However, in most cases these values do 
appear to exist, so in Section [4] we use them as statistical information about bound- 
aries. 

We mention the observation of Phillips [J] that if the sequence of rows in an 
automaton is not eventually periodic, then £{t) grows at least logarithmically. This 
is because for £ > 2 there are k^~^{k — 1)^ possible rows of length £, so a cellular 
automaton that never returns to the same state has at most exponentially many 
rows of length £. Logarithmic growth is not seen for k = 2 and d < 3, and 
we did not find logarithmic growth among d = 4 rules either. However, it is 
possible to construct an automaton that implements counting in binary, by using 
additional colors (and additional steps) to propagate carries, and this automaton 
grows logarithmically [1]. 

3. Automata with reducible growth 

In this section we describe a combined automated-manual search for reducible 
boundaries among all 2-color rules depending on d = 4 cells, begun from single-cell 
initial conditions. Eventually periodic boundary words can be detected completely 
automatically, and we examine by hand the automata that are not found to have 
an eventually periodic boundary word. 

As in every space of cellular automaton rules, some rules in this space are equiv- 
alent to others by simple transformations. For example, reversing each tuple in the 
definition of the rule and reversing the initial condition results in an image that 
is simply the left-right reflection of the original. Similarly, permuting the colors 
in a rule and in the initial condition produces an image that is obtained from the 
original by the same permutation. Therefore it suffices to consider only one rule 
among each equivalence class of rules obtained by reflecting and permuting. For 
this we choose the rule with minimal rule number. For k — 2 and d = 4 this reduces 
the number of rules from 65536 to 16704. 

As a simplifying assumption, we consider only the two initial conditions • • • I I I M I Tl • 
and • • • ^^EI^^H • • • , each consisting of an infinite constant background with a sin- 
gle perturbed cell. This results in 33408 automata (two initial conditions for each 
rule). In many cases these initial conditions suffice to characterize the growth of 
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the rule. However, for rules that grow dramatically differently depending on the 
initial condition, the data we collect may not be representative of typical growth. 

We further restrict the initial condition by requiring the background color to 
reoccur on some later step (but not necessarily the next step). That is, we only 
consider the initial condition • • • I I I M I "Tl • • • if a white background reoccurs on 
some later step. Similarly, we only consider • • • ^^EI^^H • • • if a black background 
reoccurs on a later step. We ignore these initial conditions otherwise because we 
are interested in long-term behavior, and a background that does not reoccur is a 
type of transience. Doing so reduces the number of automata to 25088. 

We run each of these automata for tmax steps and consider the difference sequence 
£{t+l)—£{t) for tnun < ^ < imax~l, with some tmin > allowing for transience. Let 
TO be the smallest positive integer such that £{t + l + m) — £{t + to) = €(i + 1) — £{t) 
for all ifnin < t < ^inax — 1 — m. If TO < (tmax ~ ^min) /4, then wc dccm the boundary 
word to be eventually periodic (and £{n) to satisfy Equation [T]) , and we record the 
period length to and the growth rate 

sum of the terms in the period £{t + m) — £(t) 



Otherwise we consider the period length unreliable and this test inconclusive. 

In choosing a time range tmin ^ t < imax on which to test periodicity, we face 
opposing goals: To overcome possible transience, we want t^-m to be large, but for 
speed we want imax to be small. Our solution is to use the four short time ranges 
20<t< 100, 50 <t< 300, 200 < t < 600, and 400 < t < 1000 as successive fihers, 
followed by a more extensive range. If a reliable period length is found in any of 
these ranges, then we skip the remaining ranges and compute the period length in 
a final range 500 < t < 4000 to confirm that the period length persists. This final 
time range identifies only 32 corrections to period lengths found by one of the first 
four ranges, and all but one of those (correcting the slope from 7/4 to 18/11 for 
rule 23726 begun from • • • I I I M I n • • • ) are cases where the boundary word does 
not appear to be eventually periodic. Running all 25088 automata through the four 
filter ranges took approximately twenty minutes on a 2.5GIIz machine. Running 
the final time range took approximately two and a half days. 

While we believe that confirming each period length in the range 500 < t < 
4000 has allowed our data to be highly reliable, this algorithm clearly does not 
guarantee that if a period length was found then the boundary word is in fact 
eventually periodic (false positives), nor does it guarantee that if a period length 
was not found then the boundary word is not eventually periodic (false negatives). 
There are several automata whose boundaries do not stabilize until well after 400 
steps or whose eventual behavior is unclear. For example, rule 11109 begun from 
• • • I I I M I n • • • grows to ^(1722) = 918, and thereafter has average growth rate 
0. Rule 4713 begun from • • • I I I B I □ • • • jettisons a particle at step 915, leaving 
behind an otherwise chaotic left boundary. Rule 10633 (begun from either initial 
condition) appears to have an eventually periodic boundary word due to its internal 
froth generally moving away from the boundary, but it is not clear that this will 
continue indefinitely. Worse, there are automata whose growth is periodic for short 
time intervals but that are most likely not periodic in general. For example, rule 457 
begun from either initial condition has a boundary word that is periodic in the range 
100 <t< 200; but for larger ranges we see that the periodicity does not continue. 
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These examples indicate that in general one cannot determine the long-term 
behavior of the boundary of a cellular automaton by examining finitely many steps. 
Of course, this is not surprising, because the boundary can depend sensitively on 
the interior of the automaton, and it is known that some cellular automaton rules 
are computationally universal. Indeed, we determined the four time ranges only 
after some experimentation with a selection of rules. 

Executing this automatic search yields 837 automata (with 620 distinct rules) 
that were not found to have an eventually periodic boundary word. Among these 
837, there are only 757 distinct pictures (at least for 500 rows), because several 
pairs of inequivalent rules appear to nonetheless generate the same evolution due to 
certain configurations not appearing. We examined each of these classes manually 
and found that 36 automata do in fact appear to have eventually periodic boundary 
words, while another 81 exhibit self- similarity. Therefore a classification of the 
25088 automata is as follows. 

(1) 24287 automata have eventually periodic boundary words. 

(2) 81 automata have boundary words that are not eventually periodic but are 
reducible. 

(3) 720 automata have boundaries that are most likely not reducible. 

Analyzing automata in the third class is the subject of Section |4] Automata in the 
first two classes have boundary words with simple descriptions, and they are the 
subject of this section. 

A note regarding the level of rigor is in order. We do not formally prove the 
claims in this section, neither the explicit growth rates nor other properties we 
describe. Therefore they can either be taken as conjectures or as semi-rigorous 
results that are experimentally verified for the first 4000 (and in some cases many 
more) steps of the cellular automata involved. Proving each claim is beyond the 
scope of the current paper, although we touch on this in Section |6] 

3.1. Eventually periodic boundary words. In the first class of 24287 automata — 
those with eventually periodic boundary words — the most common average growth 
rate is 0, and there are 11768 automata with growth rate 0. The following table 
gives the thirty most common growth rates a (as in Equation [T]) and the number 
Na of automata with each rate. 
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Na 


a 
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Na 
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Na 
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11768 


5/4 


102 


15/13 


18 


9/7 


11 


8/7 
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3 


4800 


5/3 


73 


9/4 


17 


10/7 


10 


13/8 
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4001 


6/5 


53 


9/5 


17 


7/6 


10 


11/10 


7 


1 


1082 


7/4 


45 


7/5 


17 


15/14 


10 


14/11 


6 


5/2 


985 


3/4 


40 


5/6 


15 


1/2 


10 


7/8 


6 


3/2 


951 


4/3 


28 


11/8 


11 


9/8 


9 


2/3 


6 



The smallest nonzero growth rate is 2/5, and 5 automata have growth rate 2/5. 

If r/s is a non-negative rational number written in lowest terms (with gcd(r, s) = 
1), let us define the height of r/s to be max(r, s). The height of a number is 
one measure of its complexity. The automaton whose limiting growth rate has 
largest height is rule 10168 begun from a single black cell, with growth rate a = 
1578/1013. This automaton also has the largest period length: 2026 steps. The 
next largest-height growth rates that occur are 773/411, 515/318, 398/247, 329/199, 
and 297/127; all these automata have fairly simple interiors. 
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The growth rate with largest height that is generated by two distinct rules 
(not two distinct automata that share the same rule but two distinct rules) is 
91/55. The rules are 17380 and 46236, respectively begun from • • • I I I M I n • • • 
and ' ' ' ^HZ^H - - - . 

3.2. Morphic words. The remainder of Section[3]concerns automata in the second 
class mentioned above: automata with boundary words that are not eventually pe- 
riodic but still amenable to short description. These automata all have boundaries 
that exhibit nontrivial self-similarity, so we may refer to these as fractal bound- 
aries. It turns out that the boundary words for all these automata are morphic 
words — words generated by iterating a morphism (also known as a substitution 
system). 

Let S and A be finite alphabets, and let S* denote the set of all finite words 
with letters in S. The empty word is denoted by e. For a function ip : ^ A* 
and a (finite or infinite) sequence wq, wi, . . . of letters in E, define (p{woWi • • • ) = 
ip{wo)ip{wi) ■ ■ ■ . We refer to </? as a morphism, since f{xy) = ip{x)ip{y) for all words 
x,y. If A = E and there is some letter A e E and some word a; e E* such that 
ip{A) — Ax, then by iteratively applying ip to A we obtain prefixes of the word 

(p'^{A) := Axip{x)ip'^{x) ••• , 

which is a fixed point of ip. Moreover, this word is the unique fixed point of p 
beginning with A. An infinite word (or, equivalently, an infinite sequence) w is 
morphic if there is a letter A g E and morphisms </? : E — > E* and tp : T, A* 
such that 

W = ^(VP'^(A)). 

We see in the following subsections that, for each cellular automaton with reducible 
boundary structure, the boundary word is morphic (and is a word on some finite 
set A of integers) . 

In the next three subsections we address fractal automata whose limiting growth 
rates exist. We will see that these rates do not approach the complexity of some of 
the growth rates observed for eventually periodic boundary words in Section [XT] In 
the final subsection we discuss automata whose limiting growth rates do not exist. 
(Many of the rules discussed have nearly identical behavior when begun from the 
two initial conditions; in these cases we only discuss one initial condition without 
mentioning this further.) 

We refer the reader to the book of Allouche and Shallit [51 Chapters 6-8] for a 
comprehensive treatment of morphic words. For our immediate purposes, it suffices 
to mention that prepending a word to a morphic word produces another morphic 
word. In particular, every eventually periodic word is morphic. 

3.3. Square-root growth. Before discussing square-root growth among 2-color 
rules depending on 4 cells, we first discuss cZ = 3 rule 106, which also exhibits 
square-root growth. Figure [2] shows the evolution of rule 106 begun from two 
adjacent black cells. The boundary word of this automaton is the infinite word 

W106 = 11010011000000010000000011010011 • • • 

on the alphabet {0, 1}. Let us rewrite the boundary word as 

wio6 = l^OH^O^l^O^l^O^l^OH^O^l^O^H^O^^ • • • , 
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since the run lengths of each block suggest a pair of morphisms that generate Wiog- 
In particular, observe that replacing each in wioe by 0"* causes 0^ to become 
0^2. So that O'^l^O* O^^l^O^^ we need 1 0001; however, not every 1 can be 
replaced using this rule, since this would result in no instances of 1^ in the fixed 
point. Therefore we introduce some additional letters. Consider the morphism 

(p^{A^ ABCD, B CCAB, C CCCC, D CCCD}. 

The fixed point ip'^(A) of this morphism is 

A'B'C^D^C^A'B'C'D'C'A'B^C'D'C^A'B'C^'D'C''^ ■■■ . 

Applying the morphism ip — {A — > 1,B — > 1,C — > 0,13 — > 1} to this fixed point 
gives wio6 = ipif'^iA)). 

From this morphism one can derive that rule 106 grows like ^/t. Here we show 
a weaker claim — that 1/2 is a limit point of the sequence log(^(i). Letting E — 
ABCDCCAB and F„ = C"^'^"-^ DC'^-^" , one can check that 

r(^)=( n EF,^(k)+^EC^"^'"-^D for a > 3, 
^ fe=i ^ 

where i'2{k) is the exponent of the highest power of 2 dividing k. Using v^ik) to 
count occurrences of E and in 4>°'{A) preceding ^^D gives 

22.-1 

£(2^"-^) = ^(0) + wio6(i) = 3 • 2"-^ + 1 for a > 1. 

This agrees with the observation of Gravner and GrifFeath [5] that the configuration 
at step 22"-i is 

■■■ I I I M I ]. . .l I ■ I I I ... 

Including the trailing C'^'""^~^D in (j)°'{A) leads to t{2'^°') = 3 • 2"-^ + 2 for a > 1. 

Among 2-color rules depending on 4 cells, two inequivalent rules exhibit square- 
root growth from single-cell initial conditions: rules 34394 and 39780. Although 
they are not equivalent as rules, the automata obtained by running these rules from 
a single black cell are equivalent under left-right refiection, since the tuple on which 
the rules differ does not occur in the evolution begun from a single black cell. In 
particular, rule 39780 is known to exhibit conditional reversibility, due to the local 
rule being a bijective function in the leftmost position [TU], whereas rule 34394 does 
not have this property. Figure |3] shows rule 39780. 

For both of these automata, the boundary word is 

W39780 = 22102211lllil0221022llllllll • • • , 

a word on the alphabet {—1,0, 1,2}, where we have written 1 for —1. Because of 
the repeating 11 oscillations, the run lengths of the original sequence do not reveal 
much. However, partitioning into blocks of length 2 as 



W39780 = (22)1(10)1(22)1(11)3(10)1(22)1(10)1(22)1(11)15 

(10)1(22)1(10)1(22)1(11)3(10)1(22)1(10)1(22)1(11) 
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39780 7l95 " " 27S9S 



Figure 3. Rule 39780 grows like y/i. The other four automata 
pictured here contain oscillating particles. 

shows some structure. If ip is the morphism 
{A ABC, B DAB, 

C CECE, D CECD, E CECE] 

^ {A ^ 2,B ^ 2,C ^ l,D ^ {),E ^ I}, then W39780 = 
To show again that 1/2 is a hmit point of \ogtt{t), let F = D ABC D ABC and 
G{n) = {ECy\ Then for a > 3 

(A) = \[ FG{2^'''^^''^ -1)JfC{2^"-'^ -l)E, 

where again 1^2 (^) is the exponent of the highest power of 2 dividing k. Counting 
occurrences of F and G(n) in D(f)"{A) and computing their respective lengths and 
contributions to the boundary, we obtain 

£(4"-i/2 + 2"-i) = 5 • 2°-^ for a>2. 

Note that the morphism ip for rule 106 is 4-uniform. Consequently, the sequence 
W106 is 2-automatic (meaning that there is a finite automaton that outputs the tth 
term when input the binary digits of t); it follows that £{t) is 2-regular in the sense 
of AUouche and Shallit [TTl [T^] and therefore can be computed quickly. On the 
other hand, the morphism ip for rule 39780 is not uniform, and indeed it appears 
that the sequence £{t) for this automaton is not fc-regular for any small value of k. 

3.4. Oscillating particles. Four rules have boundary words that are nearly peri- 
odic but that are perturbed occasionally by particles that oscillate in the interior 
of the automaton. They are shown in Figure [3j 

First consider rule 3701 begun from a single black cell. The right boundary is 
not perturbed when the particle reflects off of it, but the left boundary is perturbed 
at steps (3 • 5" + 5)/4 — a for a > 0. However, since the step numbers of these 
perturbations decay exponentially, they do not impact the limiting growth rate, 
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SO the limiting growth rate is 1. The boundary word is generated from A by the 
morphism ip = {A ^ AB,B SC^ C C^} fohowed by ^ = {A ^ e,B ^ 
30, 31}, where 1 = -1. 

Rule 8067 begun from a single black cell is similar, with a single particle per- 
turbing the left boundary at steps (8 • 7"+^ + 6a — 2)/9. However, the particle also 
perturbs the right boundary when it reflects at steps (20 • 7" + 6q; + 79)/9. 

Rule 7195 begun from a single black cell contains additional internal structures, 
but the net effect is that a single particle oscillates between the left and right 
boundary, with the rest of the structure remaining close to the right boundary. 
The particle in fact does not perturb the right boundary when it reflects, although 
it does perturb the left boundary. 

Rule 27898 begun from a single black cell differs in two ways from the others. 
The oscillating particle does not traverse the entire interior width of the automaton 
but reflects off an internal boundary. Additionally, the "particle" at times looks 
more like a group of particles, and not every interaction with the boundary is 
identical. However, after four reflections the particle returns to its original state, 
so the oscillatory behavior is in fact simple. 

The respective limiting growth rates for rules 8067, 7195, and 27898 are 6/5, 
5/4, and 3/2. Although we do not determine the morphisms here, the regularity of 
the oscillations in these automata imply that the boundary words are morphic. 

3.5. Two automata v^rith limiting growth rates. Figure |4] shows rule 1273 be- 
gun from a single black cell and rule 36226 begun from a single white cell. The 
boundary words for these automata are not eventually periodic, but they are mor- 
phic. Moreover, the limiting growth rate Mmt^oD ((t) / 1 (Equation [2]) exists for 
each. 

We begin with rule 36226 because its boundary is simpler. On a global scale 
this automaton exhibits nested structure similar to that produced by d = 3 rule 90 
begun from a single black cell (see Figurejl]). However, the right boundary is fractal. 
The boundary word 

W36226 = 12211221221112212211221221111221 • • • . 

can be obtained by dropping the first two letters in the fixed point 2212211 • • • of 
the morphism ip — {1 ^ 1,2 221}. Recalling that i'2{n) denotes the exponent 
of the highest power of 2 dividing n, we can also write 

W36226 = n 

n>2 

The limiting growth rate of the automaton is determined by the frequencies of 1 
and 2 in W3g226- The frequency of a letter x in an infinite word wqWi • • • is 

lim ~ ~ ^. 

t-^oo t 

To compute the letter frequencies, we examine the incidence matrix of ip, which 
records for each pair of letters x,y the number of occurrences of x in ip{y). The 
incidence matrix for tp is 

h 1" 
2 ■ 

If the frequency of each letter in a morphic word {A) exists, then the vector whose 
components are the letter frequencies is an eigenvector of the incidence matrix 
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Figure 4. Rows through 2^ - 1 of rule 1273 and rule 36226, 
where the limiting growth rates have been used to shear the images 
such that the nonperiodic boundaries are vertical. The colors of 
rule 36226 have been reversed to place it against a white back- 
ground. 

corresponding to the largest positive eigenvalue (8, Theorem 8.4.6]. In the case of 
W36226, the letter frequencies exist, and that vector is (1/2,1/2). Therefore the 
letters 1 and 2 occur with equal frequency, and on average the automaton grows 
3/2 cells per step. 

Now consider rule 1273 begun from a single black cell. The interior is also 
nested, although the ncstcdncss is not as obvious visually. For this automaton, the 
boundary word 

W1273 = 31303031313130303131313030313030 • • • 
(where again 1 = —1) is given by {3iy{30)'^ilj{(p'^{A)), where 

ip^{A^ AC, B ^ AD, C BA, D BB}, 

and ip maps 

A (3i)=^(30)2(3i)3(30)2(31)i(30)2 
B^ (31)3(30)2(31)^(30)2 
C ^ (31)5(30)2(31)3(30)2(31)1(30)2 
(31)5(30)2(31)5(30)2. 

The incidence matrix for ip is 

"1110" 

12 

1 0' 
10 

so the vector with components equal to the frequencies of the four letters A, B, C, D 
is (4/9,2/9,2/9,1/9). The letters A, B, C, and D correspond to respective net 
changes of 32, 28, 36, and 32 cells over 26, 24, 30, and 28 steps, and so one computes 
that the limiting growth rate is 6/5 cells per step. 
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3.6. Automata with no limiting growth rate. Finally, a number of automata 
have linear growth in the sense that the limiting growth exponent linif_i.oo logj £{t) 
is 1 although the limiting growth rate limt^oo £{t) / 1 does not exist. 

As a typical example, consider rule 2230 begun from a single black cell. The 
boundary word is 

W2230 = 2i0^23022^0''2i20**2240i*' • • • . 

Replacing — ?> 00 and 2 — > 22 produces W2230 again, with the exception of the first 
three letters 202. In other words, the structure of W2230 is that of the fixed point 
beginning with A of the morphism ip = {A ^ ABCB, B -> BB, C -> CC}: 

^'^{A) = A^B^C^B^C^B^C^B^'^C^B^^C^'^ ■■■ . 

Applying %p = {A ^ e, B ^ 2,C ^ 0} produces W223o- 

The frequencies of the letters B and C in (p'^(A) turn out to not exist: The 
frequency of B in the first 4 • 2" - 2 letters is (3 • 2" - 2)/(4 • 2" - 2), and the 
frequency of B in the first 5 • 2" - 2 letters is (3 • 2" - 2)/ (5 • 2" - 2). Since 3/4 and 
3/5 are both limit points oi \{Q < i < t — 1 : Wi — B}\/t, the frequency of B does 
not exist. Similarly, the frequency of C does not exist. 

Consequently, the frequencies of and 2 do not exist in W22307 ^^id the growth 
rate limt^ao £{t) /t does not exist. However, the growth can still be quantified by 
Oinf '■— liuiiid £{t)/t = 6/5 and Osup ■= limsup£(t)/t — 3/2. 

Several other automata have boundaries that are also generated by the morphism 
ip — {A^ ABCB, B — )• BB, C — >■ CC}, followed by some morphism ip- The values 
flinf and Qsup can be computed for these automata as well. Representatives are 
shown in Figure [5j and bounds on their growth are given in the following table. 



rule 


initial condition 






V'(C) 


ftinf 


^sup 


2230 


...1 1 1 ■ 1 1 1... 


e 


2 





6/5 


3/2 


3283 


...1 1 1 ■ 1 1 1... 


21 


2121 


22 


3/4 


6/7 


6681 


...1 1 1 ■ 1 1 1... 


31 


3231 


33 


15/16 


15/14 


10155 




2121 


1121 


11 


15/16 


15/14 


10389 




32 


3232 


33 


9/8 


9/7 


10389 




230 


3030 


33 


9/8 


9/7 


10644 




2 


12 





9/8 


9/7 


11032 




12 


12 





9/8 


9/7 


37018 




2 


02 





3/4 


6/7 


39066 




1 


11 





3/4 


6/7 


39394 




12220 


12220 


00 


21/19 


21/17 


41114 




22 


02 





3/4 


6/7 



Three additional morphisms ip generate the boundary words of automata with 
no limiting growth rate. 

For rule 15268 begun from a single black cell, the boundary word is W15268 — 
'>p{ip"{A)), where 

(p^{A^ ABC, B BB, C ^ CC} 
iP = {A^ 220, B ^ 12, C ^ 00}. 

The extremal limit points are a-mf = 3/4 and Ogup ~ 1. 
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Figure 5. Some nested automata with boundary words gener- 
ated by the morphism {A ABCB,B BB,C CC}. They 
are variants on a common underlying structure, for which the hm- 
iting growth rate does not exist. 
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For rule 4334 begun from a single black cell, the morphisms are 

ip = {A^ AEDBB, B BB, C ^ CC, D ^ DB, E EC} 

i})^{A^ 122, B ^ 22, C 00, £1 ^ 12, E 10}, 

and we have Oinf = 6/5 and Cgup = 3/2. 

For rule 11172 begun from a single black cell, the morphisms are 

^ = -> AEDB, B BB, C ^ CC, D -> DB, E EC} 
= {A ^ 2, B ^ 21, C ^ QQ, D ^ Q2, E ^ 2l}, 

and we have Oinf = 3/4 and a^up = 1- 

4. Automata with irreducible boundaries 

Among the 25088 equivalence classes of A; = 2, d = 4 cellular automata, 720 
automata evaded all attempts to reduce their boundaries. Among these 720, there 
are only 688 distinct pictures, since some pairs of inequivalent rules appear to 
generate the same evolution. In this section we first comment on the variety of 
unpredictable behavior found among these boundaries, and then we use tools from 
Brownian motion to study them more quantitatively. 

4.1. Qualitative taxonomy. Dependence on a fourth neighbor [d — 4) permits 
kinds of irregular boundaries that do not occur for the smaller neighborhood d = 3. 
Here we attempt to qualitatively survey the different behavior. Some automata, 
in spite of their chaotic-looking interiors, have stable-looking boundaries, but their 
chaotic interiors likely prevent the boundaries from stabilizing. The growth of these 
boundaries may represent an average of the input from the interior. Examples 
include rules 2020, 2717, 3223, 3493, 5267, 6116, 6773 (begun from one black cell) 
and 5603 and 5881 (white cell); Figure [6] shows rule 2020. Some of these boundaries, 
such as 6773 from black, are periodic for thousands of time steps at a time, but the 
chaotic interior seems to perpetually break the boundary's reducibility. 

Even more exotic and nontrivial boundaries exist. For instance, rules 5673, 6629 
and 7721 begun from a single black cell appear to have the rare property of rough 
boundaries on both sides. Rule 7077 (from black) jettisons a particle at the speed 
of light to the left, which the slower, apparently chaotic boundary cannot catch. 
In other cases, interior rather than exterior particles dominate the behavior of the 
boundaries. For instance, rule 7379 (from either black or white) jettisons diagonal 
patterns from the left boundary that run nearly parallel to it, while rules 8144 
(from black) and 11237 (black or white) create particles that collide with the left 
boundary at a more oblique angle, which indicates that growth of boundaries may 
depend on delicate, internal patterns. The nonperiodic internal structures of these 
automata come remarkably close to the left boundaries; the internal structures seem 
to persist, causing the boundaries to be nonperiodic. 

Most of these boundaries grow with significant average velocity near the speed 
of light {d~l cells per time step). Others grow as slowly as 0.02 cells per time step 



(see Section 4.3 1. For instance, rule 46728 from white (shown in Figure [6| has an 
internal boundary resembling a lazy random walk that occasionally collides with 
the (otherwise straight) external boundary. To quantify descriptions like these, 
we next study the 688 unpredictable boundaries by treating them like Brownian 
motion. 
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46728 7077 11237 

Figure 6. Examples of qualitatively different kinds of bound- 
aries conjectured to be irreducible. Clockwise from the top: a 
chaotic interior with a rather stable boundary that is periodic for 
thousands of time steps at a time (2020); rough boundaries on both 
sides (6629); interior particles collide with (and likely prevent the 
reducibihty of) the boundary (11237); a light-speed particle out- 
runs a slower, more chaotic boundary (7077); an internal boundary 
resembling a lazy random walk that occasionally hits the (other- 
wise straight) external boundary (46728). 



4.2. Random walk statistics. To draw an analogy between unpredictable bound- 
aries and random walks, we note that the average growth t{t)/t and variance of the 
difference sequence l{t -I- 1) — (-{t) of boundaries of cellular automata are analogs of 
the drift and diffusivity of the Brownian motion of molecular motors [13] . In light 
of this parallel, we define the drift U to be the average growth rate 

U = hm i{t) It 

and the diffusivity D to be the variance of the difference sequence 

D ^ lim Var(£(l) - ^(0), ^(2) - £{1), . . . , £(i + 1) - £{t)). 

Continuing the analogy with molecular motors, we define a Peclet number for 
boundaries of automata to be the ratio of the drift and diffusivity. 



The Peclet number Pe measures the coherence of the boundary [12]: a large Pe 
indicates nearly deterministic movement in a clear direction, whereas a small Pe 
indicates a meandering, noisy trajectory. Its inverse r — 1/Pe is the randomness of 
the boundary [13]. 

In Figure[7]wc plot the distributions of the four random walk statistics (C/, D, r, Pe) 
of the 688 irreducible boundaries. Sorting and plotting these on log-linear scales 
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shows that U, D,r, Pe decay approximately exponentially over two orders of mag- 
nitude among the 688 irregular boundaries. This observation, and others in this sec- 
tion, are robust to changes in the number of time steps tmax G {500, 1500, 5000, 10000} 
of evaluating the automataj^ The data stored in CellularAutomatonData [5] 
is for tinax = 10*, and we show these results throughout this section. 

We also fit the boundaries to linear {at + c), power law {at^ and at'' + c), and 
logarithmic (alog(6t) +c) functional forms. To select the "best fit" that maximizes 
the (for accuracy) and that minimizes the Akaike Information Criterion (AIC) 
(for parsimony) [2], we choose the fit that maximizes ■ exp((AICmin — AIC)/2), 
where AlCmin is the minimum AIC among all models il4j . 

As expected, for reducible boundaries, the slope a of the linear fit at + c approxi- 
mately equals both the empirical estimate of the drift, ^(imax)/^maxi smd the growth 
rate a in Equation [T] computed using Equation [3] For boundaries conjectured to 
be irreducible, the slope a of the linear fit at + c is nearly equal to the empirical 
estimate of the drift (for t^ax = 10*, a and U differ by just 0.002 ± 0.004). 

Irrational limiting growth rates are known to exist for cellular automata that 
compute powers of integers in a certain base |15[ pages 613-615]. However, we 
did not recognize by visual inspection any irrational numbers among the growth 
rates of the irregular boundaries, which suggests that they do not exist for k — 2, 
d = A rules. Recognizing exact irrational growth rates is difficult, since one expects 
^(imax)/imax for tmax = 10* to agrcc with thc limiting growth rate for at most four 
or five digits. 

No boundaries were deemed best fit by the logarithmic functional form, but 
190 of the 688 irregular boundaries were deemed best fit by a power law. The 
exponents h of these power laws all lie in the interval [0.85,1.17], except for the 
two slowest-growing boundaries, 7403 and 7419, both begun from a black cell (with 
exponents b = 0.03 and 0.01). (For more on the slowest-growing boundaries, see 



Section 4.3 ) We reject power law fits with exponents — 1| < 0.01, because these 
are more reasonably deemed linear fits. We conclude that nearly all the boundaries 
that grow as power laws have exponents near 1. Exponents above 1 occur when 
the parameter a < 1, which cannot be accurate for sufficiently large t because 
£(t) < 3t + 1 for all d = 4 automata begun from a single-cell initial condition. 
Neither adjusting tmax nor dropping tens or hundreds of the first boundary lengths 
(to allow for a transient) eliminates the power law exponents larger than 1. This 
illustrates the diflticulties of fitting irregular boundaries to functional forms using 
standard nonlinear fitting algorithms. 

The drift U and diffusivity D characterize what kinds of random walks these 
irregular boundaries behave like. Notably, one quarter of the 688 automata have 
diffusivity 0.15 < D < 0.25, which creates a "knee" in Figure [7] For comparison, a 
simple random walk with steps 1,-1 occurring with probability p, 1— p has variance 
0.25 for p = i (2 - ^/3) « 0.067. Such a random walk moves rather coherently in a 
certain direction, reflected by its large Peclet number Pe 2y/3 « 3.5 that is also 
common among the irregular boundaries. 

Turning our attention to thc drift U and diffusivity D of all 688 irregular bound- 
aries, we find a gap in the scatter plot of D and U in Figure [Sj This gap suggests 



^The values of U, D for almost every automaton change little from calculations up to time 
imax = 5000 to calculations up to time tmax = 10000 (e.g., 2/3 of the diffusivitics change by 
< 0.01, while 90% change by < 0.05). 



18 



CHARLES D. BRUMMITT AND ERIC ROWLAND 




Figure 7. The four random walk statistics (drift U, diffusivity 
Z?, randomness r, Peclet number Pe) of the 688 irreducible bound- 
aries decay approximately exponentially when sorted in decreasing 
order. Dashed lines approximate the slopes on log-linear scales. In 
the rightmost plot, we sort and plot the exponents of the power 
law fits for the 190 boundaries deemed to be better fit by a power 
law {at^ or at'' + c) than linear or logarithmic; not shown are the 
exceptionally small exponents b = 0.03 and 0.01 of rules 7403 and 
7419. 




u 



Figure 8. An unexpected gap in the relationship between diffusiv- 
ity D and drift U (computed for 10** steps) suggests a threshold ex- 
ists in the behavior of irreducible boundaries of cellular automata: 
they either grow erratically and quickly or more deterministically 
and slowly. 



the existence of a threshold: irreducible boundaries of automata either grow quickly 
and erratically (large U,D; upper-right region of Figure |8]) or more slowly and de- 
terministically (small U, D; lower-left region of Figure [8|. This scatter plot and its 
gap do not change qualitatively for different numbers of time steps imax- 

4.3. Slow growth. Fast boundary growth is common: the mean growth rate 
among the boundaries conjectured to be irreducible is large, (U) ~ 1.27. Slow 
growth, by contrast, is delicate and rare (see the sparse region U < 0.5 in Fig- 
ure [s]). Table [l] shows the ten automata that grow most slowly among the 688 
automata with apparently irreducible boundaries. The last column depicts the 
initial terms of the sequences i{t). 

The very slowest automaton (at least in the first 5000 steps) is rule 7403 begun 
from • • • I I I M I T1 • • • , shown in Figure [9j which does something quite surprising. 
Its boundary continues to grow slowly for more than half a million steps, reaching 
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rule initial condition drift U l{t) for t < 500 



7403 
7419 
2295 
2295 
11411 
11411 
38538 
34490 
34458 
1690 



I I I ■ I I I 
I I I ■ I I I 
I I I ■ I I I 



I I I ■ I I I 



I I I ■ I I I 
I I I ■ I I I 
I I I ■ I I I 



0.021404 
0.023805 
0.210042 
0.210042 
0.230046 
0.230446 
0.233647 
0.264053 
0.266853 
0.296859 




Table 1. The ten slowest of the presumably irreducible bound- 
aries (in the first 5000 time steps). 



only £(524557) = 174. After step 524557 the growth increases dramatically, reach- 
ing length 277 at step 525000 and length 36819 at step 10^. So while the average 
growth rate for the range < t < 500000 is 0.000348, the average growth rate for 
500000 <t<10^ is 0.073290, as if for some reason a growth rate as low as 0.000348 
is not sustainable. Figure [9] (bottom) and Figure 10 show the point at which the 



growth rate changes. We have no explanation for this behavior. 

The boundary of rule 7419 begun from • • • I I I M I n • • • (also shown in Figure[9]) 

also exhibits extended slow growth. Unlike rule 7403, however, its growth rate does 
not appear to suddenly increase. Due to its relatively short rows, one can quickly 
evolve it for a large number of steps. For example, we compute £{10^) — 271, and 
one suspects that the growth of this automaton is not linear in general but is better 
modeled by at^. We have no explanation for this continued slow growth either. 

We remark that the pictures generated by rules 7403 and 7419, shown in Fig- 
ure [9] resemble each other significantly. They largely consist of vertical lines, with 
structure reminiscent of counting in binary. Further work should be undertaken to 
understand these rules and to determine the extent to which they are reducible. 

4.4. Potential for universality. Rule 7555 is interesting as a potentially pro- 
grammable rule and hence a candidate for universality. Begun from either initial 
condition, the picture that rule 7555 generates strongly suggests that it performs 
some kind of arithmetic, with clear particles of varying slopes at times passing 
through each other and at other times interacting. Its left boundary depends sensi- 
tively on the computations being performed in the interior, and, for example, after 
changing position thirteen times in the range 10000 < t < 20000 when begun from 
• • • ^^EI^^H • • • , it remains constant for more than 3000 steps beginning at step 
20555. 

5. An AUTOMATON WITH NO GROWTH EXPONENT 



In this section we show that it is not possible in general to assign a growth 
function to a cellular automaton. In particular, we construct an automaton such 
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Figure 9. Top: Rules 7403 and 7419 begun from 
• • • I I I M I n • • • , the slowest-growing k = 2, d = 4 automata with 
single-cell initial conditions. Bottom: The first 6 x 10^ steps of rule 
7403, sampled every 128 steps, illustrate the explosion of boundary 
growth at step 524557. 



BOUNDARY GROWTH IN ONE-DIMENSIONAL CELLULAR AUTOMATA 



21 




Figure 10. Steps 524000 through 534001 of rule 7403, broken up 
into two cohimns. After growing to just 174 cells wide in the first 
524000 steps, the automaton begins to grow much more rapidly at 
t = 524557, reaching length 1429 at time t = 534001. 
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Figure 11. Left: A cellular automaton that squares integers, 
shown here squaring i — 1 ^ 6. Right: A cellular automaton with 
no growth exponent, shown for 128 steps and 1152 steps. 



that the limiting growth exponent 

lim log,£(i) 

t—^OD 

does not exist. 

The idea is to take rule 106 begun from • • • IZlZlZ^BZnill • • • (shown in Figure [2]), 
which grows like and to graft onto it an automaton that roughly squares the 
length of a row. We set up the squaring rule to be activated at certain steps, causing 
the sequence £{t) to grow to be on the order of t, and then we allow it to fall back 
to the boundary of rule 106 on the order of y/i before being activated again. As 
a result, the sequence £{t) oscillates between square-root growth and linear growth 
and satisfies 

hminf logj ^(t) = -, lim sup log^ f (i) = 1. 

t-foo 2 t-i-oo 

A squaring rule that works by repeated addition was given by Wolfram [151 
page 639] using fc = 8 and d = 3. Begun from the initial condition 

113000- •• , 




this rule produces a row of length i — £ after 3£ — 5£ steps. Figure 11 shows the 
automaton squaring the integer 6. 

A fci-color rule and a fc2-color rule can be combined into a single (fciA:2)-color 
rule that can be thought of as their direct product and that can run the two 
rules in parallel. Since of course we do not want the two rules to run completely 
independently, we modify the composite rule so that there is some interaction. 
In particular, modifications to the squaring automaton, including the addition of 
one color, inhibit future squarings until the current squaring is finished and the 
automaton has shrunk to the width of rule 106. Hence our composite rule uses 
2 X 9 = 18 colors. The broad outline is as follows. 

Step 3 in rule 106 consists of four black cells. We choose the initial condition so 
that the squaring automaton is first activated on step 4. Since the squaring needs 
to be activated locally, we modify the squaring automaton so that it squares a row 
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using only information from its two endpoints rather than from the entire interval of 
cells. The relevant interval for squaring on step 4 has length ^ = 5, so the squaring 
automaton takes — 5£ = 50 steps to square. From the time the squaring begins 
until it finishes, the squaring automaton runs independently of rule 106. 

After the squaring completes at step 54, we must clear the cells used by the 
squaring automaton. To do this, we add a new color to mark the leftmost nonempty 
column. When the last addition is complete, we send out a particle from this column 
that travels to the right and clears the cells involved in squaring. 

When the clearing particle reaches the rightmost remnant of the squaring au- 
tomaton, we trigger a particle traveling back to the left to signify that the next 
squaring can begin. When this particle first encounters a structure from rule 106, 
it stops propagating to the left and remains in that column to trigger the next 
squaring when rule 106 next has two adjacent black cells at the right endpoint, and 
the process begins again. 

The result is a rule with fc = 18 and d = 4, begun from the initial condi- 



tion • • •0002899003000- • • . Figure 11 shows two images of this automaton. The 
complete rule instructions are available in CellularAutomatonData [6-. Even 
though both rule 106 and the squaring automaton are functions of 3 cells, it is 
necessary to shear one of the rules relative to the other to bring their structures 
into alignment, hence d — 4. 

We now verify by induction that triggering the initial squaring on step 3 enables 
one to easily determine when all future squarings will occur. (For other initial 
triggering steps this is not the case.) We claim that squarings are triggered precisely 
on steps 2'^"+2 - 1 for a > 0. 

For a = the squaring at step 3 is guaranteed by our choice of initial condition. 

Inductively, assume that a squaring is triggered on step 2*°'^^ — 1 for a fixed a. 
On step 2^*"+^ — 1, rule 106 has a solid black row of length 3 • 2^" -I- 1. The squaring 
rule begins squaring 3 • 2^" -I- 2 on the following step and reaches maximum length 
9 . 24" + 9 . 2^" + 5 on step 31 • 2-*" + 21 • 2^" -|- 2. The length is maximal for three 
steps, and then the length shrinks one cell per step until the particle reaches the 
boundary of rule 106; this occurs at step 5 • 2'^"'^^ + 9- 2^"+^ -|-3, because the length 
of rule 106 is 3 • 22("+i) + 1 for aU 24"+5 < t < 2^°'+^ - 1, and one checks that 
2ia+5 < 5 . 24a+3 _^ g . 22a + l _^ 3 <- 24a+6 _ 2. For 24"+^ < t < 2^°^+^ - 2, the right 
endpoint of rule 106 is a single black cell, and the next occurrence of two adjacent 
black cells is on step 24("+i)+2 - l. 

It follows that the subsequence of the steps where squarings begin has limiting 
exponent 

log(3 • 22" + 1) 1 

hm — ; — ; — -, 

o->oo log(24"+2 - 1) 2 
and the subsequence of the steps where squarings end has limiting exponent 

lim • 24" + 9 • 22" + 5) _^ 

log(31 • 24" + 21 • 22" + 2) ^ 

6. Conclusions and open questions 

In this paper we have inventoried the boundaries of all cellular automaton rules 
using k — 2 colors and depending on d = 4 cells when begun from a single cell on 
a constant background. Within this rule space we have encountered several kinds 
of behavior not seen in smaller spaces. In particular, we find fractal boundaries 
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described by morphia words. By studying the unpredictable boundaries as if they 
were random walks, we find approximately exponential distributions of the mean 
and variance of the boundaries' growth and a possible separation into two classes 
of automata, ones that grow quickly and erratically and others that grow slowly 
and more deterministically. 

For simplicity, we have restricted our attention in many ways. We have only 
considered the two initial conditions • • • I I I M I "Tl • • • and • • • ^HZ^H • • • . A 
more general study of A; = 2, d = 4 rules will consider other initial conditions 
and attempt to determine to what extent each rule has a representative growth 
rate. More generally still, one can consider initial conditions with backgrounds 
that are not constant but are periodic, because there still exists a natural notion 
of the length of a row. Finally, the rule space we studied is big, but it is not 
huge, and one can imagine performing similar analyses on larger spaces of cellular 
automata. We hope that researchers in fact do all of these things, and we have 
designed CellularAutomatonData to scale to these more general settings. 

Another topic to be addressed is the issue of distinct automata that nevertheless 
generate the same evolution (or an evolution equivalent under reflection or permu- 
tation of colors) because certain local configurations of cells do not appear. For 
example, rules 34394 and 39780 can generate identical evolutions, as mentioned in 
Section |3.3[ At the beginning of Section [4] we encountered this phenomenon again. 
(Although we did not mention it earlier, among the 688 distinct evolutions gener- 
ated by the 720 irreducible automata there appear to be only 658 distinct boundary 
words.) The prevalence of equivalent evolutions generated by inequivalent rules sug- 
gests that one should use more complex initial conditions to distinguish such rules. 
One possible criterion for a representative initial condition for a given rule is that 
all fc'' local configurations that can (for some initial condition) occur infinitely often 
in an evolution do occur infinitely often. 

This paper concerns external boundaries, which are simply special cases of gen- 
eral boundaries between distinct regions of a cellular automaton's evolution. The 
advantage of studying external boundaries is the ease of defining and therefore 
programmatically detecting them. However, internal boundaries (or particles) are 
common in automata, and several information-theoretic measures have been used 
to detect them [161 El UHl US] and their collisions '20] . We expect that our au- 
tomated and manual methods could inform a study of general boundaries. Con- 
versely, information-theoretic tools for internal boundaries may be applied to ex- 
ternal boundaries to systematically measure, for instance, how much they store and 
process information [ST] [52] . 

In most cases, the cellular automaton data we computed is empirical and has 
not been formally proved to be correct. (We welcome any corrections.) Of course 
ideally we would like to have proofs. The automata with morphic boundary words 
that are not eventually periodic are few enough, at least in the space k — 2, d = A, 
that it is reasonable to attempt to prove manually that each boundary word is de- 
scribed by the morphism claimed. On the other hand, for the 24287 automata with 
eventually periodic boundary words, obtaining proofs by hand is not reasonable, 
and one must develop automated techniques for examining a rule and initial condi- 
tion to determine (rigorously) the growth rate and the eventual period length. Of 
course, the question of whether the boundary word is eventually periodic is likely 



BOUNDARY GROWTH IN ONE-DIMENSIONAL CELLULAR AUTOMATA 



25 



undecidable in general. However, a symbolic approach capable of proving a large 
number of growth rates would be of great interest. 

From the results in this paper, several natural questions arise regarding the 
growth of cellular automata. 

• Which morphic words occur as the boundary word of a cellular automaton? 

• For what real numbers < 6 < 1 is there a cellular automaton with lim- 
iting growth exponent b — lim(_j.oo logj i?(t)? Schaeffer |23j has recently 
constructed cellular automata with row lengths that grow like t^^"^ for any 
integer m > 3, and where = (1 + v%)/2. 

• Schaeffer [23] has also constructed an automaton with £{t) = 0{y/t\ogt). 
What can be said in general about possible and impossible growth func- 
tions? 

• How does the growth of an automaton depend on k and d? For example, 
what is the smallest nonzero rational growth rate that occurs for given k 
and dl 

These and other questions indicate the breadth of mathematics and experimenta- 
tion to be done on the boundaries of cellular automata. 
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